Discriminative Slot Detection Using Kernel Methods
نویسندگان
چکیده
Most traditional information extraction approaches are generative models that assume events exist in text in certain patterns and these patterns can be regenerated in various ways. These assumptions limited the syntactic clues being considered for finding an event and confined these approaches to a particular syntactic level. This paper presents a discriminative framework based on kernel SVMs that takes into account different levels of syntactic information and automatically identifies the appropriate clues. Kernels are used to represent certain levels of syntactic structure and can be combined in principled ways as input for an SVM. We will show that by combining a low level sequence kernel with a high level kernel on a GLARF dependency graph, the new approach outperformed a good rule-based system on slot filler detection for MUC-6.
منابع مشابه
Remote homology detection based on oligomer distances
MOTIVATION Remote homology detection is among the most intensively researched problems in bioinformatics. Currently discriminative approaches, especially kernel-based methods, provide the most accurate results. However, kernel methods also show several drawbacks: in many cases prediction of new sequences is computationally expensive, often kernels lack an interpretable model for analysis of cha...
متن کاملFeatures Extraction For Protein Homology Detection Using Hidden Markov Models Combining Scores
Few years back, Jaakkola and Haussler published a method of combining generative and discriminative approaches for detecting protein homologies. The method was a variant of support vector machines using a new kernel function called Fisher Kernel. They begin by training a generative hidden Markov model for a protein family. Then, using the model, they derive a vector of features called Fisher sc...
متن کاملEvaluation of Cardiovascular Disease Risk in the China Kadoorie Biobank Using Novelty Detection
We evaluate the risks of cardiovascular disease to the Chinese population by i) detecting ”abnormality” using 3 one-class classification methods (a discriminative one-class support vector machine (SVM), a generative kernel density estimate (KDE), and a discriminative KDE), and ii) predicting probabilities of ”normality”, arrhythmia, and ischemia using 3class classification method (a discriminat...
متن کاملAlignmentfreie Analyse von Proteinsequenzen mit Verfahren des maschinellen Lernens
Motivation: Remote homology detection is among the most intensively researched problems in bioinformatics. Currently discriminative approaches, especially kernel-basedmethods, provide themost accurate results. However, kernel methods also show several drawbacks: in many cases prediction of new sequences is computationally expensive, often kernels lack an interpretable model for analysis of char...
متن کاملThe Spectrum Kernel: A String Kernel for SVM Protein Classification
We introduce a new sequence-similarity kernel, the spectrum kernel, for use with support vector machines (SVMs) in a discriminative approach to the protein classification problem. Our kernel is conceptually simple and efficient to compute and, in experiments on the SCOP database, performs well in comparison with state-of-the-art methods for homology detection. Moreover, our method produces an S...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2004